计算机与现代化 ›› 2010, Vol. 1 ›› Issue (5): 24-29.doi: 10.3969/j.issn.1006-2475.2010.05.008

• 人工智能 • 上一篇    下一篇

基于倾斜时间窗口的高效数据流偏向最近聚类分析算法

廖建平,马文龙   

  1. 衢州学院信息与电力工程系,浙江 衢州 324000
  • 收稿日期:2009-12-14 修回日期:1900-01-01 出版日期:2010-05-10 发布日期:2010-05-10

Efficient Recent-biased Clustering Algorithm of Data Stream Based on Tilted-time Window

LIAO Jian-ping, MA Wen-long   

  1. Department of Information and Electric Power Engineering, Quzhou College, Quzhou 324000, China
  • Received:2009-12-14 Revised:1900-01-01 Online:2010-05-10 Published:2010-05-10

摘要: 提出一种基于倾斜时间窗口的数据流偏向最近聚类算法。该算法首先通过将滑动窗口中数据等长分割形成不重叠的数据块——基本窗口,然后对每一基本窗口以Haar小波变换提取窗口数据的特征,通过改变所取各基本窗口小波变换系数个数达到保留较多最近数据细节特征的目的,即对于越近的基本窗口保留越多的小波系数而越旧的基本窗口保留越少的小波系数,最后通过定义数据流偏向最近距离,完成基于倾斜时间窗口的偏向最近聚类算法。该算法计算速度快,能高效地实现数据流偏向最近聚类分析。仿真实验验证了该算法的有效性。

关键词: 数据流, k-means, 偏向最近, 倾斜时间窗口, 聚类分析

Abstract: A recent-biased clustering algorithm of data stream based on tilted-time window is proposed. First, the algorithm segments sliding window equal in length to form no overlap data blocks(basic window), then extracts feature of every data block through Haar wavelet transform, and preserves detailed feature of recent data by varying number of wavelet coefficients of data block, namely more recent data block, more wavelet coefficient preserved, and vice versa. Finally, by defining recent-biased distance of data stream, the recent-biased clustering algorithm of data stream based on tilted-time window is implemented. Remarkably faster computational speed and higher efficiency are achieved by this algorithm. Experiments on real validate the proposed algorithm.

Key words: data stream, k-means, recent-biased, tilted-time window, clustering analysis

中图分类号: